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An evolutionarily stable strategy (ESS) is an equilibrium strategy 
that is immune to invasions by rare alternative ( "mutant" ) strategies. 
Unlike Nash equilibria, ESS do not always exist in finite games. In 
this paper we address the question of what happens when the size 
of the game increases: does an ESS exist for "almost every large" 
game? Letting the entries in the n x n game matrix be independently 
randomly chosen according to a distribution F, we study the number 
of ESS with support of size 2. In particular, we show that, as n ^ 
oo, the probability of having such an ESS: (i) converges to 1 for 
distributions F with "exponential and faster decreasing tails" (e.g., 
uniform, normal, exponential); and (ii) converges to 1 — l/\/e for 
distributions F with "slower than exponential decreasing tails" (e.g., 
lognormal, Pareto, Cauchy). 

Our results also imply that the expected number of vertices of the 
convex hull of n random points in the plane converges to infinity for 
the distributions in (i), and to 4 for the distributions in (ii). 

1. Introduction. The concept of evolutionarily stable strategy (ESS for 
short), introduced by Maynard Smith and Price [12], refers to a strategy 
that, when played by the whole population, is immune to invasions by rare 
alternative ("mutant") strategies (see Section 2.1 for precise definitions). 
Formally, an ESS corresponds to a symmetric Nash equilibrium that satisfies 
an additional stability requirement. Every (symmetric) finite game has a 
(symmetric) Nash equilibrium. But the same is not true for ESS: there are 
games with finitely many pure strategies that have no ESS. Moreover, the 
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nonexistence of ESS is not an "isolated" phenomenon: it holds for open sets 
of games. ^ 

This leads us to the question of what happens when the number of strate- 
gies is large: does an ESS exist for "almost every large game" ? Specifically, 
assuming that the payoffs in the game are randomly chosen (they are inde- 
pendent and identically distributed random variables), what is the proba- 
bility that an ESS exists, and what is the limit of this probability as the size 
of the game increases? 

For pure ESS, the answer to this question is simple: the probability that a 
pure ESS exists is 1 — (1 — l/n)", which converges to 1 — 1/e ~ 63% as n — > oo, 
where n is the number of strategies. What about mixed ESS? Here we study 
mixed ESS with support of size 2 — called ''^two-point ESS" — and find out 
that, unlike pure ESS, the answer depends on the underlying distribution F 
from which the payoffs are drawn. 

By way of illustration, consider the family of cumulative distribution func- 
tions Fa{x) = 1 — e~^" for all x>0, where q > 0. Our result is: 

• When a > 1 the probability that there is a two-point ESS converges to 1 
as"^ n oo. 

• When a < 1 the probability that there is a two-point ESS converges to 
1 - 1/^^39% as^ n^oo. 

Moreover, we show that the distribution of the number of two-point ESS 
converges to a Poisson distribution, with a parameter converging to infinity 
when a > 1, and with a parameter of 1/2 when a < 1. 

This threshold phenomenon is not restricted to the class F^. We identify 
two classes of distributions. The first is a class of "light-tailed" distributions 
with tail probabilities 1 — F{x) that decrease exponentially as x — > oo (i.e., 
exponential distributions) or faster (e.g., normal distributions, uniform dis- 
tributions on bounded intervals, logistic distributions); they all lead to the 
same result as F^ for a > 1. The second is a class of "heavy-tailed" dis- 
tributions with tail probabilities that decrease slower than exponentially as 
X — > oo (including, in particular, the following distributions: Pareto, Cauchy, 
lognormal, stable with parameter less than 2), which all behave like F^ for 
a < 1. We refer to these two classes, respectively, as SJ-' for ^^Exponential 
and Faster decreasing tails," and S£ for ^^Slower than Exponential decreas- 
ing tails" (see Sections 4 and 5 for precise definitions). 



■^For instance, the "rock-scissors-paper" game of Example 9.2.1 in van Damme [15], 
and all its small enough perturbations, have no ESS. 

^So a fortiori the probability that an ESS exists converges to 1 in this case. 

*We also show in this case that the probability that there is either a pure or a two-point 
ESS converges to 1 - e"^/^ ~ 78%. 
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An interesting consequence of our results concerns the classic problem of 
the number of vertices of the convex hull of a collection of random points in 
the plane, originally studied by Renyi and Sulanke [13]; see Section 3. Tak- 
ing symmetric versions of the distributions^ Fa, and assuming that the 2n 
coordinates of the n points in the plane are independent and Fa-distributed, 
we have: 

• When a > 1 the expected number of vertices of the convex hull of n 
random points in the plane converges to infinity as oo. 

• When a < 1 the expected number of vertices of the convex hull of n 
random points in the plane converges to 4 as n — > oo. 

In addition, in the second case a < 1, the number of vertices converges in 
probability to 4; thus, the convex hull is a quadrilateral with probability 
converging to 1. Here again, the results hold for the general classes J-£ and 
S£, respectively. 

The paper is organized as follows. The two classes of distributions are 
defined in Sections 4.1 and 5.1, respectively. Our main results for ESS are 
stated in Theorems 1 and 2 in Section 2.2 (see also Theorem 17 in Section 4.2 
and Theorem 33 in Section 5.3), and, for the number of vertices, in Theorem 
10 in Section 3. Section 2 presents the model — ESS and random games — 
together with some preliminary results. Section 3 deals with the number of 
vertices of random polygons. The detailed analysis is provided in Sections 4 
and 5, and we conclude with a discussion in Section 6. 

2. Preliminaries. 

2.1. Evolutionarily stable strategies. The setup is that of a symmetric 
two-person game, with the payoffs given by the n x n matrix R = 
{R{i,j))i.j=i^...,n- The interpretation is that a meeting between two players, 
the first playing the pure strategy i and the second playing the pure strategy 
j (where 1 < i,j < n), yields a payoff of R{i,j) to the first, and R{j,i) to 
the second (these payoffs may be viewed as a measure of "fitness" or "re- 
productive success").^ A mixed strategy p is a probability vector on the set 
of pure strategies, that is, p = (pi, . . . ,pn) G A(n) := {x G M" : J2^=i = 1}; 
the payoff function R is bilinear ly extended to pairs of mixed strategies: 

R{p,q)■■=l:l=lT.';=lP^qjR{hj)■ 

A mixed strategy p G A{n) is an evolutionarily stable strategy (ESS) for 
the matrix R if it satisfies the following conditions (Maynard Smith and 
Price [12]): 



■■^That is, Fc{x) = (l/2)e~l"=l° for x < and Fa{x) = 1 - (l/2)e"''° for x > [a distri- 
bution F is symmetric if F{—x) = 1 — F{x) for all x]. 

^Thus the payoff matrix of the first player is R, and that of the second player is , 
the transpose of R. 
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[ESSl] R{p,p) > R{q,p) for all q G A(n). 

[ESS2] li q p satisfies R{q,p) = R{p,p), then R{q,q) < R{p,q). 

This definition is equivalent to the requirement that for every qj^p there 
exists an "invasion barrier" b{q) > such that R{p, (1 — e)p + eq) > R{q, (1 — 
e)p + eg) for all e G (0, 6(g)). The interpretation of this inequality is that 
any small enough proportion e [i.e., less than b{q)] of g- mutants cannot 
successfully invade a p-population, since the mutants' (average) payoff is 
strictly less than that of the existing population. 

An ESS p is called an (.-point ESS if the support supp(p) = {i:pi> 0} 
of p is of size L In particular, when £ = 1 we have a pure ESS. In the 
biological setup, i = 1 corresponds to "monomorphism," and ^ > 1 to 
allele polymorphism." Let S'^"'* = S'^"'* {R) be the number of £-point ESS for 
the matrix R. 

2.2. ESS of random games. Let F be a cumulative distribution function 
on M. We will assume throughout this paper that F is continuous with a 
support (a, 6) that is either finite or infinite (i.e., — oo < a <b < oo). For 
every integer n > 1, let = i?^") be an n x n matrix whose elements are 
independent F-distributed random variables; the number of ^-point ESS of 
is now a random variable S^"^ . 

We use the following notation: E for expectation; C{Z) for the distribu- 
tion function of the random variable Z; Poisson(A) for the Poisson distri- 
bution with parameter A [i.e., C{Z) = Poisson(A) if P{Z = k) = e~^\'^ /k\ 
for all integers /c > 0]; and the convergence of distributions is with respect 
to the variation norm [i.e., the Zi-norm on measures: ||£(Zi) — £(^2)11 = 
|P(-Z^i = k) — P{Z2 = k)\]. The two classes of distributions, namely, the 
"exponential and faster decreasing tails" class SJ- and the "slower than ex- 
ponential decreasing tails" class S£, will be formally defined in Sections 4.1 
and 5.1, respectively. 

(n) 

We now state our main results on 5*2 , the number of two-point ESS: 

Theorem 1. IfF^ET, then, asn^oo: 

(i) /i„:=E(5^"))^oo; 

(ii) [[^(Sg"^) — Poisson(/i„)|| ^0; and 

(iii) P {there is a two-point ESS) 1. 

Theorem 2. IJFgSS, then, asn^oo: 

(i) /x„:=E(5("))^l/2; 

(ii) ||/:(55"^) - Poisson(l/2)|| ^ 0; and 

(iii) P{there is a two-point ESS) 1 - e^^/^ ~ 0.39. 
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For the convergence to Poisson distributions (ii) we will use a result of 
the so-called "Chen-Stein method" that requires estimating only the first 
two moments (see Section 2.5); surprisingly, our proofs in the two cases are 
different. As for (iii), they are immediate from (ii). The two theorems are 
proved in Sections 4 and 5, respectively. Note that, for distributions in ET ^ 
Theorem l(iii) implies that the probability that there is an ESS converges 
to 1 [see Section 6(c)]. 

Returning to the definition of ESS in Section 2.1, condition [ESSl] says 
that p is a best reply to itself, that is, (p,p) is a Nash equilibrium. By the 
bilinearity of i?, it is equivalent to: R{i,p) = R{p,p) for all i G supp(p), and 
R{j,p) < R{p-,p) for all j ^ supp(p). Since F is a continuous distribution, it 
follows that, with probability 1, the inequalities are strict, that is, R{j,p) < 
R{p, p) for all j ^ supp(p) [the jth. row is independent of the rows in supp(p)] . 
Therefore, there are no best replies to p outside the support of^ p, that is, 
R{(liP) = R{P-,P) if and only if supp(g) C supp(p). Thus condition [ESS2] 
applies only to such q, and we obtain (see Haigh [10]): 

Lemma 3. For a random matrix R, the following hold a.s.: 

(i) i is a pure ESS if and only if R{i, i) > R{j, i) for all j / i. 

(ii) There is a two-point ESS with support {i,j} if and only if there exist 
Pi,Pj > such that PiR{i,i) +pjR{i,j) =piR{j,i) +PjR{j,j) > PiR{k,i) + 
PjR{k,j) for all k / and R{i,i) < R{j,i) and R{j,j) < R{i,j). 

The following is immediate from (i) (see Haigh [10]): 

Proposition 4. S^\ the number of pure ESS, is a Binomial(n, 1/n) 
random variable, and thus £(S'|"'*) — > Poisson(l) as n ^ oo. 

Proof. S^"^ = X^iLi C« where Ci is the indicator that i is a pure ESS, 
that is, R{i,i) > R{j,i) for all j ^ i, and so P(Ci = 1) = 1/n. □ 

For two-point ESS, we can express their number S^^ as a sum of n(n — 
l)/2 identically distributed indicators, 

= ^ Dij, 

l<i<j<n 

(n) 

where Dij = is the indicator that columns i,j provide a two-point 
ESS.^ To study the asymptotic behavior of 5*2"^ we will need to evaluate 



'^So is a quasi-strict Nash equilibrium. 

^Lemma 3(ii) implies that, a.s., for each i^ j there can be at most one ESS with 
support {i,j} (in fact, condition [ESS2] implies that the supports of two distinct ESS p 
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the first two moments (see Section 2.5), namely, P{Dij = 1) = P(Di2 = 
1) and P(Ai = Aj' = 1) = P(^i2 = Di3 = 1) (when {i,j} and {i',j'} are 
disjoint, Dij and D^'j' Siie independent, since Dij is a function of the entries 
in columns i and j only). 

2.3. First moment. The event that D12 = 1 depends only on the en- 
tries in the first two columns of the matrix R, which we will denote Xi = 
R{i, 1) and Yi = R{i, 2). Thus Xi,. . . , Xn, Yi, . . . ,Yn are 2n independent F- 
distributed random variables. For each i, let Pi := {Xi,Yi) be the corre- 
sponding point in M^. The two points Pi and P2 are almost surely distinct, 
and thus determine a line Ax + By = C through them, where^ 

(1) A:=Yi-Y2, B:=X2-Xi, C := X2Y1 - X1Y2. 

Finally, we denote by F = F*^"^ the event that there is a two-point ESS with 
support {1,2}, that is, D12 = 1; recalling Lemma 3(ii), we have 

F = f(") := {Xi <X2,Yi> Y2,AXk + BY^ < C for ah /c = 3, . . . , n}. 

Let fj-n :=E(52"'^) denote the expected number of two-point ESS. Then 

(2) Mn=(2)p(r(")). 

We now define an auxiliary random variable U = U^'^\ a function of Pi 
and P2, as follows: 

(3) u.^l P(^^3 + BY3 > C|Pi, P2), if Xi < X2 and Yi > Y2, 

\ 1, otherwise, 

where A,B and C are determined as above (1) by Pi and P2. Thus U is the 
probability that an independent point lies above the line through Pi and P2 
when Xi < X2 and Yi > I2 • Let Fjj be the cumulative distribution function 
of U [note that Fuil') = P(Xi < X2,Yi > Y2) = 1/4]. We have 

Lemma 5. 

P{T)= [\l-ur-^dFuiu). 
Jo 

Proof. Immediate since U is determined by Pi and P2, and for all > 3 
the points Pfc are independent of U and P{AXk+BYk > C\Pi,P2) = U (the 
atom at n = 1 does not matter since the integrand vanishes there) . □ 



and p' can never be comparable, i.e., neither supp(p) C supp(p') nor supp(p) D supp(p') 
can hold). 

^A,B and C are thus random variables that are functions of Pi and P2. 
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Corollary 6. 



p(i)i2 = i) = P(r) 



(n 



2) / {l-u)''~^Fu{u)du 



Jo 



Proof. Integrate by parts 



/ {l-ur-^dFu{u) 



[(1 



Jo 




Jo 



and note that the first term vanishes. □ 

2.4. Second moment. To evaluate P(L'i2 = -D13 = 1), we need the entries 
in the third column of the matrix R as well. Let Zi = R{i,3) be n random 
variables that are F-distributed, with all the Xi,Yi,Zi independent. Let T' 
be the event that Dis = 1 (we will use ' for the XZ-problem), that is, 

r' := {Xi < X3, Zi > Z3, A'Xk + B'Zk < C for all A; / 1, 3}, 

where A',B' and C" are determined by P{ = {Xi,Zi) and P3 = (X^jZ^) 
[cf. (1)]. Let U' be the corresponding random variable: U' := F{A'X2 + 
B'Y2 > C'\Pi,P^) if Xi < Xs and Zi > Z3, and U' := 1 otherwise; put W := 
max{U,U'}, with cumulative distribution function F\y. 

Proposition 7. 



P{AX., + BYi < C, A'Xi + B'Z[ < C7'|Pi, P2, P^ P3) 

< min{P(^Xfc + BYk < C|Pi, P2), P(^'Xfc + B'Z'^ < C'\P[,P!,)} 
= min{l - U,l-U'} = 1- max{U, U'} = 1- W. 




Proof. For each /c > 4 we have 



Therefore 




As in Corollary 6, integrating by parts yields the result. □ 
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2.5. Poisson approximation. The "Chen-Stein method" yields Poisson 
approximations for sums of Bernoulh random variables whose dependence is 
not too large. We will use the following formulation due to Arratia, Goldstein 
and Gordon [1]: 

Theorem 8. Let I be an arbitrary index set. For each a € I, let Za be 
a Bernoulli random variable with P{Za = 1) = 1 — P{Za = 0) = pa > 0, and 
let Ba C I be the ^^neighborhood of dependence" for a; that is, a G B^ and 
Za is independent of Zp for all (3 ^ Ba- Put 

Z := ^ Zq,, 
X:=Y,B{Za) = Y.Pa, 
bi:=J2J2 E(Z,)E(Z^) = E E W' 

a&I l3&Ba ael ^eBa 

62 := E E ^(ZaZfs). 

Then 

\\C{Z) - Poisson(A)|| < 2(61 + b2)^—^ < 2(61 + b2). 

A 

Proof. Theorem 1 in Arratia, Goldstein and Gordon [1], with no "near- 
independence" (i.e., 63 = 63 = 0). □ 

2.6. Notation. We use the following standard notation, all as n — > oo:g{n) ~ 
h(n) for lim.n g{n)/h{n) = 1; g{n) < h{n) for limsup„ (7(n)//i(n) < 1; g{n) ~ 
h{n) for < \imng{n)/h{n) < 00; g{n) = 0{h{n)) for limsup„ (7(n)//i(n) < 
00; and g{n) = o{h[n)) for \\m.n g{n) / h[n) = 0. Also, log is the natural loga- 
rithm logg throughout. 

3. The convex hull of n random points in the plane. Interestingly, the ex- 

(n) 

pectation of 52 is related to the number of vertices, or edges, of the con- 
vex hull K of the n random points in the plane Pi, ■ ■ ■ ,Pn (the connection 
does not, however, extend beyond the first moments). Denote that number 
hy V = V^^^ , and let Vq be the number of edges of K whose outward normal 
is positive.^'' The distribution F is called symmetric if F{—x) = 1 — F{x) for 
all X [or, more generally, if there exists xq such that F(xo — x) = l — F(xo + x) 
for all x]. 



The "outward normal" to an edge of K is perpendicular to the edge and points away 
from the interior of K. 
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Proposition 9. 

2^, = E(yo)>P(l^o>0) = l--. 

n 

Moreover, if F is symmetric, then 

8^1n = 'E{V). 

Proof. Let Eij be the indicator that the hne segment PiPj is an edge 
of K with positive outward normal; then Vq = J2i<j ^ij- Clearly, P{Eij = 
1) = P{Ei2 = 1) = 2P(r) (if the additional condition Xi < X2,Yi > Y2 in 
r is not satisfied, interchange Pi and P2] this yields the factor 2), and so 
E{Vo) = (n(n - l)/2)2P(r) = 2^^„. 

Now Vo = if and only if there is a point Pi that is maximal in both the 
X- and the ^-direction, that is, Xi = maxj Xj and also Yi = maxj Yj. The 
probability of this event is 1/n (letting i be the index where Xi = maxj Xj, 
the probability that Yi = maxjYj is 1/n, since the y's are independent of 
the X's). Therefore, 

E(Vo)>P(V^o>l) = l-^- 

If F is symmetric, the same holds for outward normals in each of the four 
quadrants, and so B{V) = 4E(Vo). □ 

Our main result for the number of vertices V^'^^ is: 

Theorem 10. Let F be a symmetric distribution. Then, as n— >oo.' 

(i) ifF£ £T, then E(l^(")) ^ 00; and 

(ii) ifFGSS, i/ien E(F(")) ^4 and P(y(") =4) ^ 1. 

Proof. Combine Proposition 9 above with results that will be obtained 
in the next two sections: Proposition 12 for (i) and Corollary 20 for (ii). □ 

Some intuition for the interesting result (ii) for "heavy-tailed" distribu- 
tions is provided immediately after the proof of Theorem 19 in Section 5.2.^^ 
Figures^'^ 1 and 2 show, for each one of five different distributions, n = 10,000 



Fisher [8] shows that for certain distributions (including the WeibuU distributions 
with parameter < a < 1) the hmit shape of the normalized convex huU is {{x,y) £ 
R : |a;| + \y\ < 1} — which is the convex hull of four points. However, this does not imply 
that the number of vertices l/*-"' converges to 4, since there may be many vertices close to 
each one of these four points (as is the case for the uniform distribution, where the limit 
shape is the unit square, and V^"-' — > oo). 
^^Generated by maple. 
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Fig. 1. The number of vertices V of the convex hull of n random points drawn from three 
distributions in £T . (a) Uniform distribution: n = 10,000, V = 29. (b) Normal distribu- 
tion: n = 10,000, V = 16. (c) Exponential distribution: n = 10,000, V = 9. 



random points together with their convex hull and the resulting number of 
vertices V^"^^ . In the context of random points drawn from radially symmet- 
ric distributions (rather than independent coordinates), Carnal [4] has shown 
that E(y(")) converges to a constant > 4 for a certain class of heavy-tailed 
distributions (with the constant depending on the distribution). 

We conclude this section with a lemma that is useful when comparing 
distributions (see its use in the next section). 



EVOLUTIONARILY STABLE STRATEGIES 



11 




Fig. 2. The number of vertices V of the convex hull of n random points drawn from 
two distributions in S£ . (a) Weibull distribution, a = 1/2: 71 = 10,000, V = A. (b) Cauchy 
distribution: n = 10,000, V = 4. 



Lemma 11. Let F' and F" be two distributions, with supports^'^ {o.',b') 
and {a" ,b") and corresponding and /u". If there exists a strictly increasing 
convex function if : (a", b") (a', b') such that F"{x) = F'{(p{x)) for all x G 
(a", 6"), then < ^i^^. 

Proof. Let (Xj')i<j<„ and (^/)i<i<n be independent and F'-distributed 
random variables, and define := ip~^{X-) and Y/' := ip~^{Y/). Put P/ = 
{X'i, y/) and Pi' = (Xf , y/'), and let K' and K" be the convex hulls of 
and {Pl'}^, respectively. Since P(Xf <x)= P{ip-^{X'i) < x) = P(X,' < ip{x)) = 
F'{ip{x)) = F"{x), the (Xf ), and (F/Oi are ^''-distributed. If Ax + By is a 
supporting line to K' at Pj with A,B>0, then {Ap)x + {Bq)y is a support- 
ing line to K" at P/', where p,q> are subgradients of if at X'^ and Y- , 
respectively. Therefore Vq + 1 < Vg" -|- 1 (the number of vertices supported 
by positive outward normals is larger by one than the number of edges sup- 
ported by such normals), and so fj.'^ < fi'^. □ 

4. Exponential and faster decreasing tails. 

4.1. The class ET . We define the class of distributions ET with ^''Ex- 
ponential and Faster decreasing tails" as those continuous distributions F 



— oo < a' < 6' < oo and — oo < a" <b" <oo. 
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with support (a, b) (where — oo <a<b< oo) whose "tail" G{x) = 1 — F{x) 
is a log-concave function; that is, G{x) = e~^^^^ where g : (a, b) — > (0, oo) 
is a strictly increasing convex function. The functions G and g = — log G 
are usually called the survival function and the cumulative hazard function, 
respectively; for a collection of results on log-concave probabilities, see Bag- 
noli and Bergstrom [2].^^ A sufficient (but not necessary) condition for the 
log-concavity of G is that the density function f = F' be continuously dif- 
ferentiable and log-concave. Some distributions included in the class £J- are 
the following (for simplicity, we take standard normalizations; replacing x 
with \x + v for any A > and v clearly preserves the log-concavity of G) : 

• Exponential: G{x) = for x S (0,oo). 

• Normal: G{x) = J^{2TT)-^/^e-y^/^ dy for x E (-00,00). 

• Weibull with parameter a>l: G{x) = for x G (0,oo), where a > 1 
(these are the Fa of the Introduction). 

• G{x)=e~'^ for X G (—00, 00). 

• Logistic: G{x) = 1/(1 + e^) for x G (—00, 00). 

• Uniform: G{x) = 1 — x for x G (0, 1). 

Each such distribution is by definition an increasing convex transfor- 
mation of the exponential distribution: if F{x) = 1 — e~^^^\ then F{x) = 
F^^P{g{x)) for every x in the support of F [where F°^p{x) = 1 — e~^ is the 
exponential cumulative distribution function]. By Lemma 11, it thus follows 
that the exponential distribution yields the lower bound on /i^ over the class 
£J^. Now Haigh [11] proved that fi^^ w log log n, and so we have 

Proposition 12. If F £ ET , then ^„ ^ 00 as n ^ 00. 

Proof. If F G EJ=, then /^^ > yS"^^ « log log n — > 00 by Lemma 11 and 
Haigh [11]. □ 

Renyi and Sulanke [13] provide more precise results: ~ -v/Iogn and 

^uniform ^ logn. Also, we note that the class ET can be taken to be much 
larger; see Section 6(b). 

4.2. Poisson approximation. Our Theorem 1 for the class EJ^ is an im- 
mediate consequence of Proposition 12, together with the general result of 
Theorem 13 below (which holds for any distribution F, not necessarily in 
EJ-'). The analysis will also yield the universal upper bound of Theorem 17. 



^''The class of positive random variables with a log-concave G is usually called IFR (for 
Increasing Failure Rate). 
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Theorem 13. For every distribution F, 

||£(S'2"^) — Poisson(;Un)|| = O ( ) as n ^ oo. 



The remainder of this section is devoted to the proof of Theorem 13. 
For every x G M and u £ (0,1), let v{x]u) := P(C/ < u\Xi = x) [recall the 
definition (3) of [/]; then 



(4) Fu{u)= I v{x]u)dF{x) 
and 

(5) Fw{u)= I v{x;ufdF{x), 



since, given Xi = x, the events U <u and U' <u are independent (the first 
depends on Yi,X2,Y2 and the second on ^1,^3,^3; see Section 2.4). 

For every 6 > and u G (0,1), let Hui^) be determined by the equation 
F(X + bY > Ku{b)) = u (it is unique since X + bY is a continuous random 
variable). Let K = Ku := {{x,y) eM."^ : x + by < Ku{b) for all 6 > 0} be the 
set of all points that are not contained in any half-plane of probability u 
with positive normal (see Figure 3). Clearly, if either Pi € K 01 K, then 
U >u [since, for all 6 > 0, the line x + by = c through that point has c < Ku{b) 
and so P(X + bY >c)> P(X + 61" > k„(6)) = u]. The set K is convex (it is 
an intersection of half-spaces) and comprehensive [i.e., {x',y') < {x,y) E K 
implies that {x',y') E K]. Let y = ri{x;u) be the equation of its boundary, 
that is, ri{x;u) := sup{y : (x,y) E K} [with r]{x]u) := —00 when there is no 
such y]. We have: 



Lemma 14. For every x and u E (0, 1) 

v{x] u) < uG{r]{x; u)) < u. 



Proof. Let Pi = (xi,yi). If Pi E K, then, as we saw above, P{U < 
u\Xi = xi,Yi = yi) = 0. 

If Pi ^ K, then yi > ri{xi;u) (again, see Figure 3); let bo = bo{xi,yi) := 
inf{6 > 0:xi -|- byi > Ku{b)}. The function is continuous since the distri- 
bution F is continuous, and so xi + b^yi > k«(6o) [note that we may well 
have 60 = 0, for which k.(i(0) = G~^{u)]. Assume that U <u; then there ex- 
ists 6 > such that X2 + 6I2 = xi + byi > Ku{b), and so 6 > 6o- Now Y2 < yi 
(since U < n < 1); therefore X2 + 60^2 > a^i + 602/1, which, as we saw above, 
is > Ku{bo). Thus U <u implies that P2 lies above the line x + b^y = Ku{bo), 
and so P(C/ < u|Xi = xi, Yi = yi) < P(X2 -|- 60^2 > i^u{bo)) <uhy definition 
of K„. 
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Fig. 3. If U < u, then P2 lies m the darkly shaded area; the probability of the whole 
shaded area is u (Lemma 14). 

Taking expectation over Yi = yi therefore yields 

P{U < u\Xi = xi) < OP(Pi G K\Xi = xi) + nP(Pi ^ K\Xi = xi) 

< uP{Yi > r]{xi;u)) = uG{7]{xi;u)) < u. □ 

Lemma 15. For every x and u G (0, 1) 

G{x)G{r]{x;u)) <u. 



Proof. If Pi = (2:1,^1) ^ K, then (see Figure 4) there exists 6 > such 
that c := xi + byi > Ku{b), and so P(X + bV > c) < u. Therefore, F{X > 
xi,Y> yi) < P{X + bY>c)<u, and so G{xi)G{yi) < u. This holds for all 
2/1 > ri{xi;u), and G is a continuous function. □ 



Combining the inequalities in the last two lemmas yields: 
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Fig. 4. The probability of the whole shaded area is u, and that of the darkly shaded area 
is G{xi)G{yi) (Lemma 15). 



Corollary 16. For every x and u € (0, 1) 

. ^ f 

u{x;u) <mm.\ u,—— 
I G{x) 

From this we can immediately obtain an upper bound on /i„ which appHes 
to any distribution F. This bound is known; see Devroye [6]. 

Corollary 17. For every distribution F, 
( log n \ 

P(r)=0( — —\ and fin = 0{\ogn) asn^oo. 

Proof. Let t be such that G{t) = u. Applying Corollary 16 in the for- 
mula (4) yields 

/oo rt y2 I'co 

u{x;u)dF{x)< ——dF{x)+ udF{x) 
-oo J-oo Cr{x) Jt 
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[ -dz + u^ =u^ floe - + 1 



[we have used the substitution z = G{x)\. Therefore, 

P(r) = (n - 2) /\l - uT~^Fu{u) du 
Jo 

< (n - 2) J\l - uY^^u^ {log ^ + l^du 



<^ + 0(^). 



n2 /• n 



We can now prove Theorem 13. 



Proof of Theorem 13. Let i{u) be determined by G{^{u)) = ^jFu{u). 
For X < £,{u), we wih use the inequahty v{x]u) < /G{x) to get 

v{x]u) dF{x)< / v{x; u) dF{x) 



oo 



G{x) 



< / v{x]u)dF{x) 



n2 



< / i/(x; n) (iF(x) 



n2 



For X > ^(ti), we use u{x;u) <u to get 



Fu{u) = u^^Fu{u). 



v{x;ufdF{x)< / =u2g(C(w)) = u%/Fc/(u). 



Altogether, 

Fw{u) < lu^Fuiu). 

Therefore, 

/' 

Jo 



{l-u)''-''Fwiu)du 



<2j {l-uT-^u'^^Fu{u)du 

rl \ 1/2 / /.I \ 1/2 

\^ (1 - uf-^Fu{u) duj n (1 - u^-^Uu] , 
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by the Cauchy-Schwarz inequality. The first integral is P(r)/(?i — 2) = 
0{nnn~^) [by Corollary 6 and (2)], the second integral is 0{n~^), and so 

[\l - uT-^Fw{u)du = 0{ixll^n-^). 
Jo 

Therefore, by Proposition 7, 

p(rnr') = o(^y2^-3). 

We now apply Theorem 8 to 52™^ = J2i<j^ij- There are n(n — l)/2 = 
O(n^) terms Dij; the neighborhood of dependence of each Dij consists of 
Djfc and Dj/^ for all k, and so it is of size 2n — 3 = 0{n). Therefore, 

6, = 0{n^)0{n)P{Tf = 0{n\^Lr,n-^f) = 0{filn-') 

and 

62 = o(n2)o(n)p(r n r') = o(^y2). 

This yields ||£(52"^) - Poisson(^,„)|| < 2{bi + 62)//^n = 0{finn~^ + /-fn^^^)- 
Now E(y(")) = 0(logn), and so ^„ = O(logn), for any distribution F; this 

follows from Theorem 1 (for dimension d = 2) in [6]. Therefore [[/^(Sg"^) — 

/2 

Poisson(/i„) II = 0(/in )• □ 

Proof of Theorem 1. Combine Proposition 12 and Theorem 13. □ 
5. Slower than exponential decreasing tails. 

5.1. The class S£. We define the class of distributions S£ with ^'Slower 
than Exponential decreasing tails" as those distributions F with support 
(a,oo) (where a > — oo) whose tail G = 1 — F satisfies the following two 
conditions: 

[SEl] Subexponentiality" : 

P(X+ + Y+ > t) ~ 2G(i) ast^oo, 

where X, Y are independent F-distributed random variables and := 
max{Z, 0}; and 
[SE2] ''Uniformity": For ah c> 1, 

(6) G{ct) > G{ty as t — > oo, uniformly as c ^ 1^; 

that is, for every e > there exist to = tQ{e) and cq = co(e) > 1 such 
that G{ct)/G{tf >l-£ for all t > to and all c e (1, cq). 
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Distributions satisfying [SEl] are called subexponential distributions (orig- 
inally introduced by Chistyakov [5]). Some examples are (see Table 3.7 in 
Goldie and Kliippelberg [9]; again, we use standard normalizations for sim- 
plicity): 

• Regularly varying tails: G{x) = x~"i{x), where a > and ^ is a slowly 
varying function, that is, limx^oo£{cx)/£{x) = 1 for every c > 0. This in- 
cludes: 

- Pareto: G{x) = x~" for x G (1, cxo), where a > 0. 

- Cauchy: G{x) = /~(7r(l -hy^))"! = arctan(x)/7r + 1/2 for x G (0, oo). 

- a-stable, where < a < 2. 

• Lognormal: G{x) = /^(\/27ry)^^e~^°s ^^"^ dy for (0, oo). 

• Weibull with parameter < a < 1: G{x) = for x G (0, oo). 

• ^''Almost'''' exponential: G{x) = e~^^^^^^ " for x G (l,oo), where a > 0. 

(However, the exponential distribution does not satisfy [SEl].) 

As for condition [SE2], in terms of the cumulative hazard function g{t) : = 
— logG(t), it says that for every e > there exist to = toi^) and cq = co(e) > 1 
such that g{ct) < cg{t) + e for all t> t^ and all c G (l,co). Therefore, a 
sufficient condition for [SE2] is that g{t)/t be a nonincreasing function for 
large enough^^ t; this is the case when g is concave (and so G is log- convex; 
contrast with SJ-), or even star-concave^^ (we will see in Lemma 18(ii) below 
that [SEl] implies that g{t) /t — > as t ^ oo) . It is now easy to verify that all 
the distributions listed above also satisfy [SE2]. Finally, S£ is closed under 
"tail equivalence": if 1 — F{t) ~ 1 — F'{t) as t ^ oo, then F G S£ if and only 
if F' G S£ (for [SEl], see Theorem 3 in Teugels [14]). 

The next lemma collects a number of properties that will be used in the 
proof below. 

Lemma 18. Let F satisfy [SEl]. Then: 
(i) 

(7) F{X + Y>t)<2G{t) ast^oo. 

(ii) g{t) :=- log G(t) = o{t) ast^oo. 

(iii) There exist 7t > such that 

(8) lim -ft = 0, 

(9) lim 7jt = oo, 

t—fOO 

(10) lim G(t)^* = 1. 

t— >oo 



^^The class of positive random variables where g{t)/t is a nonincreasing function for 
all t is usually called DFRA (for Decreasing Failure Rate Average). 
^^That is, g{\x) > Xg{x) + (1 - X)g{0) for all x > and all < A < 1. 
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Moreover, if F also satisfies [SE2], then 

G{t) = 

Proof, (i) is immediate from [SEl] since X + Y < X+ + 1+. As for (ii), 
it is a well-known property of subexponential distributions (e.g., it follows 
from (1.4) in Goldie and Kliippelberg [9]). To get (iii), take, for example, ■jt = 
1/ \Jtg{t), and then (8), (9) and (10) immediately follow from (ii); finally, 
(10) together with (6) imply (11). □ 

5.2. First moment. In this section we will prove that, for distributions 
in S8, the expected number of two-point ESS converges to 1/2, and the 
number of vertices of the convex hull converges in probability to 4. Some 
intuition is provided after the proof of Theorem 19. The main result is 

Theorem 19. Let F G S8. Then 

P(r) ~ — ^ and /in — > - asn— >oo. 

2 

As a result, the number of vertices V^"^^ of the convex hull of n random 
points satisfies 

Corollary 20. Let F he a symmetric S8 distribution. Then 
E(y("))^4 and P(y(") = 4)^l asn^oo. 

Proof. Theorem 19 and Proposition 9 yield E(Vb) 1 and P(Vb = 
0) = 1/n ^ 0, and so P(Vb / 1) ^ 0. The resuh follows since V = 4Vb- □ 

Thus, for symmetric S£ distributions, the probability that the convex 
hull is a quadrilateral converges to 1. 

For the remainder of this section we assume that F G S8. 
The proof of Theorem 19 uses the following result: 

Proposition 21. Asu^Q 

Fu{u) < \u\ 



Before proving Proposition 21 (to which most of this section is devoted), 
we use it to prove Theorem 19. 
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Proof of Theorem 19. Given e > 0, let 5 > be such that Fu{u) < 
(1 + e)n^/2 for all u < 6. We will use Corollary 6, and separate the integral 
into two parts. For the first part, we have 

[\l - uY^'^Fuiu) du<{l + e)\ [\l - n)"-3n2 du 
Jo 2 Jo 

<(l + e)- f\l-ur-\'^du 
2 JO 

= (! + £) 



2n(n- l)(n- 2) 
As for the second part, we get 

\l - uf~^Fu{u) du<[l- 5Y~^, 

which is less than, say, e/n^ for all n large enough. Adding the two bounds, 
multiplying by n — 2, and recalling Corollary 6 yields P(r) < (1 + 2e) /n^ for 
all n large enough. The opposite inequality is in Proposition 9 [recall (2)]. 
□ 



The proof of Proposition 21 requires careful analysis. To get some in- 
tuition, consider the convex hull of n random points Pi, P2, ■ ■ ■ , Pn- Let 
Pi = (Xi,Yi) be the (a.s. unique) point with maximal X-coordinate, that 
is, Xi = maxfc Xfc. An essential property of sub exponential distributions is 
that Xi is much larger than all the other X^ for k^i. In addition, the corre- 
sponding y-coordinate, namely Yi, is also much smaller than Xi. The same 
holds for the point Pj = (Xj,Yj) with maximal y-coordinate, which implies 
that, with high probability, all the points P^ with k^i,j will lie well below 
the line connecting Pi and Pj, so that Pi and Pj will be the only vertices 
with positive outward normals. This basic picture can be seen in Figure 5 
(recall also Figure 2). The points in the region have large X (bigger than 
an appropriate t), whereas the width of (in the y-coordinate) is small 
relative to t. The same holds for the region L^, with X and Y interchanged. 
These two regions will thus "catch," with high probability, the points Pi and 
Pj with maximal X and maximal Y, respectively. 

Fix < e < 1, and let t = t^^e be such that 

(12) G{t) = {l+e)u- 

then u ^ is equivalent to t — > cx) (since e > is fixed). We will say that t 
and u correspond to one another if they are related by (12). Next, we define 
the following sets in (see Figure 5): 

L'' = Ll^,:={{x,y):x<t,y<t,x + y<t}, 
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Fig. 5. is the whole shaded area and L'^ (see Lemma 29) is the darkly shaded area. 



L^ = Lle-={ix^y)-\^\<ltt,y>t}, 

L^^Lly.= L\{L'uL^). 

The reader should keep in mind that t as well as all the sets L,L^, 
depend on u {and e). 

For simplicity, we will write F{L) instead of P(Pi G L). 

Lemma 22. Asu^O, 

P{Lu,s) = 0{u). 

Proof. 

P(L) <P{x + y>t) + P{x >t) + P{y > t) < 4G(t) = 4(1 + e)u. 
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□ 

Lemma 23. There exists uq = UQ{e) G (0, 1) such that, for all u < uq, if 
a, b, c satisfy a,b> and P{aX + bY > c) < u (where X, Y are independent 
and F -distributed), then c/a> t^^e o,nd c/b > tu,e- 

Proof. Assume without loss of generality that a>b. If c/a <t = tu^^, 
then 

P{aX + bY>c)> P{aX + bY> at) 

>P{\Y\<-/tt,X>{l + -ft)t) 
= H{jtt)G{il+^t)t), 

where H{z) := 1 — F{—z) — G{z) for z > (we used a > 6 in the second 
inequality). Now H{-ftt)G{{l + -ft)t)/G{t) ^ 1 as t ^ oo by (9) and (11), 
and so H{'jtt)G{{l + 'jt)t) > G{t)/{1 + e) = u for all t large enough, or all u 
small enough. This contradiction shows that indeed c> at >bt. □ 

Corollary 24. For all u<uq, 

P{PieLl, orP2eLl„U<u) = 0. 

Proof. If U <u < uq, then the entire set L*^ lies below the line Ax + 
By = C through Pi and P2 [this holds for its two extreme points, (f, 0) and 
(0,t), by Lemma 23, and A,B>^]; therefore U < u implies that Pi ^ 
and P2 i LP. □ 

At this point we immediately get the following bounds: 

Proposition 25. 

P{U <u) = 0{u^) asu^Q, 
fJ'n = 0{l) as CO. 

Proof. Corollary 24 and Lemma 22 imply that P{U <u) < P(Pi, P2 S 
L) = P(iv)^ = 0{u'^) as n ^ 0. Using this in the computation of the proof of 
Theorem 19 yields P(r) = ©(l/n^), and so Hn = 0{l). □ 

To get /Ltn < 1/2 will require a more refined analysis (the best constant we 
can get up to this point is /in ^ 4) . We start with a useful inequality: 

Lemma 26. Let X and Y be independent and F -distributed. Then, for 
every a, b,c,9 > 0, 

(13) PiaX + bY>c)>H(e^)GUl + e)^) + h (o^^gUi + 9)^ 
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Proof. We have 

P{aX + bY>c)> P{aX > (1 + e)c, \bY\ < Oc) 

+ P{\aX\<ec,bY >{i + e)c). □ 

The next three lemmas will deal, respectively, with the three cases: (i)-Pi G 
and P2 £ -Z^^; (ii) Pi^Pi^ or Pi,P2 G i^; and (iii) Pi,P2 S (recall 
Corollary 24). The corresponding probabilities turn out to be of the order 
of n^/2 in the first case, and o(n^) in the other two cases. 



Lemma 27. As^' u^O, 

P(Pi G Ll^„ P2 G Ll^„ U<u)<^u^ + eO{u^). 



Proof. Let Pi G and P2 G L"^ be such that U <u<uo, where is 
given by Lemma 23, and let to correspond to uq. The line P1P2 is Ax + By = 
C with A = Yi-Y2, B = X2- Xi and C = X2Y1 - X1Y2. Since U <u<uq, 
we have C /A> t and C/P > t by Lemma 23 and so, taking = 7^ in Lemma 
26, 



U = P{AX + BY > C\Pi,P2)> H{-itt) 



G((l + 7t)^)+G((l + 7t)§) 



Now X2,Yi > t and |Xi|,|y2| < Itt, and so |Xi| < 7^X2 and \Y2\ < jtYi, 
which implies that A > Yi(l - -ft),B > X2{1 - -ft) and C < X2Yi{l + jf). 
Therefore, 

^<X.i±^ and §<r.l±^^ 

A 1 - 7t P 1 - 7t 

Put pt := (1 + 7t)(l + 7|)/(1 — 7t) > 1; (8) implies that pt 1, and so from 
(6) it follows that there is ti > to large enough so that G{ptz)/G{zY^ > 
(1 + for all^^ z>t>ti. Therefore, for all t > ti, we have 

U>H{jtt)[G{ptX2) + G{ptYi)] 

> H{jtt){l + e)-'/^[G{X2r + G{Yi)P^] 

> Hijtt){l + e)-V22i-P* [G{X2) + G{Yi)r ■ 



= £0{u^) as It ^ 0" means that there exists a constant AI < 00 such that 
hmu_o/(ii,e)/?i^ < sM for every e G (0, 1) [or, equivalently, for every e G (0, 1) there exists 
u = u{e) > such that f{u, £)/u^ < eM for all w < w]. 

^**Indeed, given e > 0, let zo{e) and co(e) be such that G{cz)/G{zY > (1 + e)"^/^ for 
all z > zq and c £ (1, co); take ti > such that pt < cq for all t>ti. 
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Now H{^tt)2^~^* ^ 1 by (9); therefore, there exists t2 > ti such that H{'^tt) x 
> (1 + e)-i/2 foj. ah t > t2, and so 

U>{l + er'[G{X2)+G{Yi)r. 
Let U2 correspond to t2', then U <u <U2 imphes that 

(14) GiX2) < ((1 + e)uf'' - GiYi) = Gitf'"' - G{Y^). 

Equation (14) provides a lower bound on X2, and so 

P(P2 eL^,U < u\Pi) < P(X2 satisfies (14)|yi) < G{tf/P' - G{Yi). 
Integrating over Yi in (t,cx)), we have 

{G{tf'p^-G{yi))dF{y^) 

= G{tY+^IP' -\G{tf, 
since jrG{y^)dFiy,) = - G{y^) dG{y^) = -[G{yif /2]r = G{tf /2. Now 
l/pt = (1 - 7t)/((l + lt){l + ih) > 1 - 27t, and so G(t)i+VPt < ^(i)'-'^* ~ 
G{t)^ by (10), which implies that there is > t2 such that 
G(t)i+iM < (1 + e/2)G{tf for ah t > tg. This yields 



P(Pi G L\P2 eL^,U <u)<[l+ 2 jG(t)2 - -G(t)2 

= ]^{l + e)G{tf 

= \{l + efu' 

<i(l + 7e)n2 
for all t>t^. Now let 7x3 correspond to ^3. □ 

Lemma 28. Asu^Q, 

P(Pi, P2 G f/ < ix) = P(Pi, P2 G in,,, ?7 < n) = eO(n2). 

Proof. Let -Pi, P2 £ -^^^ be such that U <u < uq, where uq is given by 
Lemma 23, and let to correspond to uq. Then Yi>Y2> t and — 7tt < Xi < 
X2 < 7tt, and also C/B > t (by Lemma 23); therefore, 

C _ X2Y1 - X1Y2 ^ ittYi - {-^tt)Y2 _ Y1 + Y2 
A Y1-Y2 - Y1-Y2 Y1-Y2' 
from which it follows by Lemma 26 with 6 = ^t that 

U > hU^\g{{1 + 7i)^) > H{^tt)G{{l + lt)ltt^y^ 
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Let ti > to be large enough so that H{'^t't) > V(l + for all t > ti, and let 
ui correspond to ti; then U <u <ui implies that 

G ((1 + Ithtt^^) < (1 + e)u = Git); 

thus 
or 

Yi < ptY2, 

where now pt := (1 + 7t(l + 74))/ (1 - 7t(l + 7t)) > 1 and pt 1. Therefore, 
for P2£L\ 

P(Pi G L\?7 < n|P2) < P{Y2 <Yi< ptY2\Y2) 
= G{Y2) - G{ptY2) 
<GiY2)-il-e)G{Y2r, 

the last inequality holding for all t large enough, say t > t2 > ti, again by 
(6). Integrating over Y2 in (t,oo) yields 

;>oo 

FiPiGL\P2eL\U<u)< {G{y2)-{l-e)G{y2r)dF{y2) 

= \G{tf-{l-e)-^G{tY+PK 
2 1 + Pt 

Now l + pt~2 + 27t ^2, and so G{t)^+P^ / {I + pt) G{tf /2 [recah (10)]. 
Therefore, there is > t2 such that, for all t > ta, 

P(Pi G L\P2 G L\ [/ < u) < - (1 - 2e)iG(t)2 

= eG(t)2 = e(l + efu^ < Aeu^. 

The case where Pi,P2 G -^^^ is the same (interchange X and Y). □ 

Lemma 29. n ^ 0, 

P(L3,) = eO(n). 

Proof. Define := {(x, y) : x + y > t}\(L^ U L^) (see Figure 5); then 
P(L^) < P(x + y > t) - P(|x| < 7tt, y > (1 + 7t)i) 
-P(|y| <7tt,x>(l + 7t)t). 



26 S. HART, Y. RINOTT AND B. WEISS 

Let ti be large enough so that P{x + y > t) < {2 + e)G{t) and also H{jtt)G{{l + 
7t)t) > (1 -e)G(t) for all t > h [recall (7), (9) and (11)], and thus 

P(L^) < (2 + e)G{t) - 2(1 - e)G(t) = 3eG{t); 

therefore, 

P(L3) < P(l4) + P{x < -^tt, y>t) + F{x>t,y< -jtt) 
< 3eG{t) + 2F{--ftt)G{t) < 4eG{t) = 4e(l + e)u 

[note that F{--ftt) < 1 - H{-ftt) < e for t > ti]. □ 

Corollary 30. As u^Q, 

P(Pi i Ll, or P2 i L2 „ \J<u) = eOin"). 

Proof. For u small enough, 
P(Pi^Li or Pa ^ i^ C/ < -a) 

+ P(Pi G L^, P2 G L, ?7 < u) + P(Pi G L, P2 G i^ < u) 
+ P(Pi, P2 G f/ < n) + P(Pi, P2 G ^^ f/ < n) 
< + 2P(l3)P(L) + 2eO(u2) = 2eO{u)0(u) + 2eO('u2) 

by Corollary 24 and Lemmas 29, 22 and 28. □ 

Proof of Proposition 21. Adding up the estimates of Lemma 27 
and Corollary 30 yields P(C/ < u) < (1/2)^^ + eOiu^) as u ^ 0. This holds 
for every e G (0,1) and the left-hand side is independent of e, and so^^ 
V{U <u)< (l/2)n2 + o(n2) . □ 

5.3. Second moment and Poisson approximation. Recall Section 2.4 and 
Proposition 7 there. 

Proposition 31. Asn^oo, 

p(rnr') = o(?i-^). 



^^Formally, there exists M < oo such that hmsup^_^Q ?([/ < u)/v?' < 1/2 + Me for all 
e, and so limsup^_Q P((7 <u)/u'^ < 1/2. 
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Proof. Proposition 32 below will show that P{W <u) = o{u^). Thus, 
given e > 0, there is (5 > such that F{W <u) < ev? for all u < 6; then, as 
in the proof of Theorem 19, 

f\l - n)"-^FvK(^x) du<{l- <5)'^-^ + e f\l- n)'^-V du = eO{n-^). 
Jo Jo 

Multiplying by n — 3 and recalling that e > was arbitrary shows that indeed 

p(rn r') = o(n-3). □ 

It remains to show that: 

Proposition 32. As u^O, 

P{W <u) = o{u^). 

Proof. Fix e G (0, 1). First, we have 

P(Pi €L\P2eL^,U <u,P{€L\Pf^€L'^,U' < u) 

< P{Yi >t,X2> t, Zi >t,X3>t) = G{ty = O(n^). 

Next, for all u small enough [i.e., u < u(e)], 

P(Pi ^ or P2 ^L^,U < u, U' < u) 

< P(Pi ^ or P2 (^L^,U< u, P^eL)= eO{u^)0{u) = eO{u'^) 

by Corollary 30 and Lemma 22, and the fact that P3 = {X^,Z^) is indepen- 
dent of Pi = and P2 = {X2,Y2)). Similarly, we have 

P(P/^Li or P!^iL^,U' <u,U <u)=eO{u^). 

Adding up the two terms yields P(VF < n) < £0{u^) for every e G (0, 1), or 
V{W <u)<o{u^). □ 

We can now prove Theorem 2. 

Proof of Theorem 2. Again, we apply Theorem 8 to Z]i<j Aj - We 
have 

hi = 0{n'^)0{n)V{Tf = 0(l/n), 

62 = o(n2)o(n)P(r n r') = 0(1), 

by Theorem 19 and Proposition 31, and so ||£(S'2'^'') — Poisson(/i„,)|| < 2{hi + 
62) = 0(1). Now Poisson(/i„) converges to Poisson(l/2) since /i„ 1/2 by 
Theorem 19. □ 

Recall (Proposition 4) that the number of one-point (pure) ESS, 

converges in distribution to Poisson(l) as n — > cx). While ^i"^ and are 
not independent, we will now show that, nevertheless, their sum converges 
to Poisson(3/2). 
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Theorem 33. Put S^^^ := 5}^ + Sf' .IfFeSS, then 

as n —> oo. 

Proof. We apply again Theorem 8, this time to J2i Ci + J2i<j ^ij- Let 

b'l and 62 correspond to 52^^ = Z]i<j ^ij'i ™ the proof of Theorem 2 above we 
showed that b'l = 0{l/n) and 63 = The additional dependencies now are 
between a term Cj and a term Dij, with the same i and j ^ i- However, we 
have ^(CiDij) = P(Cj = Dij = 1) = 0, since Q = 1 implies that Ra > Rij, 
whereas Dij = 1 implies that Ra < Rij (see Lemma 3). Thus 62 = ^2 ~ ^(1)) 
and 

= 0(l/n + n{l/nf + n'^{l/n){l/n'^)) = 0(l/n). 

Theorem 8 yields \\C{S^^^) - Poisson(l + /i„)|| < 2{bi + 62) = o(l); and we 
have ! + 3/2 by Theorem 19. □ 

Corollary 34. If F £ S£, then the probability that there is an ESS 
with support of size < 2 converges to 1 — e~'^^'^ ~ 0.78 as 00. 

6. Discussion. We conclude with a discussion of some of the related liter- 
ature, together with a number of comments, conjectures and open problems. 

(a) Vertices and equilibria. The connection between Nash equilibria and 
vertices of random polytopes was used by Barany, Vempala and Vetta [3] 
to find Nash equilibria in random games. Concerning ESS, we emphasize 
again that the number of vertices of a random polygon and the number 
of two-point ESS of a random game have different distributions; only their 
expectations are related (by a factor of 8; see Proposition 9). 

(b) The class ET . The class of distributions with "Exponential and Faster 
decreasing tails" for which Theorem 1 holds can clearly be taken to be larger 
than that of Section 4. Indeed, since Theorem 13 holds for any distribution, 
we can include in ET any F such that /i„ ^ 00. Take, for example, those 
distributions in Fisher [8] for which the limit shape of the convex hull is a 
strictly convex set; this implies that the number of vertices, and thus /i^, 
must go to infinity. By Theorem 1 there, this includes distributions where, for 
some a > 1, the tail probability G = 1-F satisfies G-^l/^) ~ cV°G-i(l/t) 
as t ^ cxD for each c S (0, 1). 

(c) The probability of having an ESS. For distributions in EJ^, Theorem 
l(iii) implies that the probability that there is an ESS converges to 1 as 
n increases. For distributions in SE, however, it is still unknown what the 
limit of this probability is. Some preliminary informal analysis suggests to 
us the following conjectures: if F G SE, then, as n ^ 00: 
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• 5p ^Poisson(l/3); 

• S'^^^ := + 5^"^ + 5^ ^ ^ Poisson(l + 1/2 + 1/3) = Poisson(ll/6); 

• Sf"^ ^ for all £ > 4; 

• ELi-^f^ ^Poisson(ll/6); 

• P (there is an ESS) ^l-e-"/6 -0.84 <1. 

[The geometric objects corresponding to S^"^ are now the {i— 1) -dimensional 
faces of the convex hull of n random points in M^.] 

(d) Threshold phenomenon. Our distributions exhibit a "threshold" phe- 
nomenon: either /i„ ^ oo or fin 1/2- However, we believe that one may 
construct distributions for which the sequence /i„ has other limit points, 
or even oscillates wildly as n increases. Indeed, for each n, the number of 
vertices, and thus /i^, depends on the distribution F only through a certain 
interval of its tail [in a neighborhood of G~^{l/n)]. Therefore, one should be 
able to "glue" various tails (of the SJ^ or S£ types) and get different limit 
points. See Devroye [7] for such oscillations in the case of radially symmetric 
distributions. 

(e) Other distributions. It would be interesting to study additional classes 
of distributions. For example, bounded-support distributions whose tail G is 
not log-concave are not included in £J^; we conjecture that — > oo in this 
case, though perhaps the convergence is at a slower rate than the logn of 
the uniform distribution. Another question arises when the distributions of 
the X-coordinates and of the Y-coordinates differ (but, say, they are both 
in or both in S£); this concerns also the number of vertices when the 
distribution is not symmetric (consider the different orthants). 
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